Segment boundary detection via class entropy measurements in connectionist phoneme recognition

نویسندگان

چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Segment boundary detection via class entropy measurements in connectionist phoneme recognition

This article investigates the possibility to use the class entropy of the output of a connectionist phoneme recogniser to predict time boundaries between phonetic classes. The rationale is that the value of the entropy should increase in proximity of a transition between two segments that are well modelled (known) by the recognition network since it is a measure of uncertainty. The advantage of...

متن کامل

Connectionist Architectures for Multi-Speaker Phoneme Recognition

We present a number of Time-Delay Neural Network (TDNN) based architectures for multi-speaker phoneme recognition (/b,d,g/ task). We use speech of two females and four males to compare the performance of the various architectures against a baseline recognition rate of 95.9% for a single IDNN on the six-speaker /b,d,g/ task. This series of modular designs leads to a highly modular multi-network ...

متن کامل

a soft segment modeling approach for duration modeling in phoneme recognition systems

the geometric distribution of states duration is one of the main performance limiting assumptions of hidden markov modeling of speech signals. stochastic segment models, generally, and segmental hmm, specifically, overcome this deficiency partly at the cost of more complexity in both training and recognition phases. in this paper, a new duration modeling approach is presented. the main idea of ...

متن کامل

Improvements in the Stochastic Segment Model for Phoneme Recognition

The heart of a speech recognition system is the acoustic model of sub-word units (e.g., phonemes). In this work we discuss refinements of the stochastic segment model, an alternative to hidden Markov models for representation of the acoustic variability of phonemes. We concentrate on mechanisms for better modelling time correlation of features across an entire segment. Results are presented for...

متن کامل

Phoneme Boundary Detection using Deep Bidirectional LSTMs

In this paper we investigate the automatic detection of phoneme boundaries in audio recordings with the help of deep bidirectional LSTMs. This work is motivated by the needs of the project BULB which aims to support linguists in documenting unwritten languages. The automatic detection of phoneme boundaries in audio recordings of a new language is part of the technical requirements of the BULB p...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Speech Communication

سال: 2006

ISSN: 0167-6393

DOI: 10.1016/j.specom.2006.07.009